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HEPATinS B VmUS INTERACTS 
WITH CELLULAR DNA REPAIR PROCESSES 

This application is a Continiiation-in-Part of United States Application 
5 Serial No. 08/133,932 filed October 11, 1993. 

The present invention was made utilizing funds of the United States 
Government. The United States Government is entitled to certain rights 
under this invention. 

Field of the Invention 
10 This invention relates to methods of treating viral diseases, liver cancer 

secondary to viral diseases, diagnosing viral diseases and specific antibodies 
. against antigens involved in interactions with proteins of viral origin. More 
particularly it relates to the nucleic add sequence for XAP-1 and its amino 
acid sequence of the human XAP-1 protein, which is part of the DNA repair 
15 complex. 

Background 
Hepatitis B Virus (HBV): Clinical Importance 

HBV is worldwide in distribution. It causes acute and chronic liver cell 
damage and hepatocellular carcinoma (HOC; Uver cancer). HBV may cause 

20 long-term persistent infections, with the frequency depending on the age and 
immtmologic status of the host at the time of infection, and genetic factors of 
both the virus and host. Persistent infections often result in seiious Uver 
disease, including cirrhosis and cancer. The prevalence rates of persistent 
infections range from as high as 20% in certain regions of China, Southeast 

25 Asia, and subsaharan Africa to less than 0.5% in North America. It is 
estimated there are approsdmately 300 million persistently infected carriers 



wo 95/10288 



PCr/US94/11451 



2 

of HBV worldwide. An estimated 1 million deaths annually are attributable 
to the harmful effects of HBV infection. At this time, no clinically useful 
antivirals against HBV are available. 

HBV is transmitted in various ways, including from mother to 
5 ofGspring, by contact, and by parenteral and sexual routes. High-risk groups 
include parenteral drug abusers, institutionalized persons, health care 
personnel (surgeons, pathologists, and other physicians, dentists, nurses, 
laboratory technicians, and blood bank personnel), individuals who have 
recently received blood transfusions, hemodialysis patients and staff, highly 

10 promiscuous persons, and newborn infants bom to mothers with hepatitis B. 
A variety of modes of HBV transmission and rates of persistent infections 
exist between endemic countries with high prevalence rates and nonendemic 
coxmtries. In endemic areas, the major routes of infection are perinatal 
transmission from infected mothers to offspring and contact-associated 

15 transmission during the first years of life. 

It is the chronic infection with HBV that is a toajor risk factor of HCC. 
Viral infection usuaUy persists for several decades before the emergence of 
HCC, and the risk of tumor development increases with the duration of 
chronic infection. Chronic carriers are about 200 times more likely to develop 

20 HCC than tminfected persons living in the same area. Nearly 50% of Chinese 
males with chronic HBV infections will eventually stiffer HCC. Analjrses of 
liver tissue from patients with chronic HBV infection have identified both 
replicative intermediates (indicative of active viral replication) and integrated 
forms ofHBVDNA. 

25 HBV Genome and Gene Products 

HBV is a member of the hepadnavirus family, a group of hepatotropic, 
DNA-containing viruses. Additional animal hepadnaviruses include the 
woodchuck hepatitis virus (WHV), groimd squirrel hepatitis virus, and a virus 
of tree squirrels. Avian hepadnaviruses include the duck hepatitis B vims 

30 (DHBV) and DHBV-related viruses from geese, grey herons, and other species 
of ducks. Although minor differences exist, hepadnaviruses are similar in 
morphology, genome structure, and pathogenesis. All hepadnavinises are 
characterized by a restrictive host and tissue tropism and are associated vrith 
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both acute and chronic liver disease. Infectious HBV is present mainly in the 
blood of infected patients^ but it is also found in other body fltiids, such as 
saliva, urine, and semen. The liver is the target organ, but blood cells may 
also be infected. 

The HBV genome consists of partially double-stranded DNA, 
approximately 3,200 base pairs (bp) in length; it represents the smallest 
genome of any virus known to infect man. DNA sequence analyses have 
demonstrated 90—98% nucleotide sequence homology among different HBV 
isolates, and the genome structures of the HBV isolates share many features 
in common with other hepadnaviruses. The full-length (i.e., 3,200 bp) DNA 
strand of the HBV genome is of minus polarity and is complementary to all 
HBV mRNAs. In contrast, the positive strand is complementary at its 5' end 
to the first 224 bp of the negative strand, but has a variable 3' end (ranging 
from 53 to 88% of unit length). 

The organization of the HBV genome is remarkably efiBcient. Analysis 
of different HBV isolates has revealed the conservation of four long open 
reading frames (ORFs) within the genome which encode specific viral 
proteins, including the virus nucleocapsid (core antigen or HBcA^, envelope 
glycoproteins (surface antigen or HBsAg), polymerase (product of the P gene), 
and a protein from the X gene. 

HBV Replication 

Replication of HBV begins with the attachment of the virus particle to 
susceptible cells, such as hepatocytes. Following entiy into hepatocytes, the 
core component is released from the virion, and the partially double-stranded 
DNA genome is converted to a covalently closed circular (CCC) form that can 
be detected in the cell nucleus. The CCC DNA then serves as template for 
the production of HBV mRNAs. The 3.5-kb pre-genome transcript is 
encapsidated, via a packaging signal located near the 5' end of the RNA, into 
newly synthesized core particles where it serves as template for the HBV 
reverse transcriptase. An RNase*H-like activity within the HBV polymerase 
removes the RNA template, as the negative-strand DNA is ^thesized. Viral 
replicative intermediates, consisting of fidl-length minus-strand DNA plus 
variable-length (20-^0%) positive-strand DNA, are encapsidated within the 
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core particle during normal virus replication. Virus replication is completed 
as the DNA-containing core particles bud from the cell surface. 

The integration of HBV DNA into chromosomal DNA, although not a 
part of normal viral replication, occurs during chronic infection with HBV. 
5 Integrated HBV can be detected in most but not all HCCs that arise in 
HBV-positive individuals and is usually present in 1—4 genome copies per cell. 
The percentage of Uver tumors that are HBV positive approaches 95% in 
high-HBV-endemic areas such as China, and viral DNA may be detected in 
approximately 80-^% of HBV-related HCCs from other regions of the world. 

10 The analysis of many HBV inserts cloned from HCCs has established that a 
preferred site for recombination of viral DNA with cellular DNA is located 
near the DR region of the viral genome. As replicative intermediates share 
structural features with many integrated HBV forms (i.e., one end near the 
DR region, with variable lengths of the positive-strand DNA), it is possible 

15 that replicative intermediates may serve as the template for integration of 
viral DNA. 

Integration of HBV DNA is not required for viral replication, so the 
viral DNA insertions detected in the chromosomal DNA of HCCs most 
probably occur via illegitimate recombination. Such recombinational events 

20 are thought to occur at random locations. Gro£» chromosomal alterations are 
frequently observed at the site of viral integration and include deletions, 
duplications, and translocations. The effect of these chromosomal alterations 
on the hepatocyte will depend on the identity of genes nearby the insertion 
site and the extent to which gene expression is altered Although HBV 

25 integration occurs at random within chromosomes, there is a subsequent 
nonrandom selection for cells containing viral integration events in specific 
chromosomes during the evolution of a tumor. Of 28 HBV inserts cloned 
from HCCs and mapped to himian chromosomes, 61% have been assigned to 
chromosomes 3, 11, 17, and 18. When considering the mutagenic consequence 

30 of HBV integration, the data suggest that the disruption of genes on 
chromosomes 3, 11, 17, and 18 may be particularly important in the genesis 
of subsets of HCCs. 
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HBVXProtein 

The X ORF represents the smallest ORF of the HBV genome and has 
the potential to encode a protem 146 to 154 amino acids in length, depending 
on the isolate of HBV. The X ORF overlaps with parts of the P and pre-C 
5 ORFs and encompasses several regulatoiy regions of the genome, including 
both DRs, enhancer II, and the C gene promoter. The X ORF peptide 
sequence is highly conserved among different viral isolates. Although 
detection of the X protein within HBV-infected cells remains diSicult, the 
presence of anti-X protein antibodies in the sera of HBV*infected patients 

10 provides evidence that the X ORF is expressed during natural infection. 
Antibodies to X are detected most often in patients with chronic hepatitis, 
liver cirrhosis, and/or hepatoma. 

DNA transfection approaches iising the cloned X gene have clearly 
demonstrated that the X gene product can transactivate a wide variety of 

15 viral and cellular promoters. The heterogeneity of the elements responsive 
to X suggests that the X gene product exerts its effect by a mechanism other 
than direct binding to a specific DNA sequence, probably functioning via 
effects on cellular factors. The ability of the X gene to transactivate the 
expression of HBV RNA suggests an important regulatoiy role for X during 

20 viral rephcation. Indeed, animal studies have indicated the requirement for 
X expression for virus growth in vivo. 

Specific functions attributable to X protein have been difBcult to 
pinpoint. In addition to being a transactivator, X protein has been reported 
to be a serine protease inhibitor, and to activate the cellular protein kinase 

25 C signalling pathway. It has been suggested that the X protein may affect 
other celMar processes, besides transcription, that are regulated by cellular 
kinases. 

Cellular DNA Repair Mechanisms 

Accidental lesions occur continually in the DNA of eucaryotic cells. 
30 DNA damaging agents include thermal fluctuations, ultraviolet (UV) 
irradiation, environmental toxins, and man-made chemicals. It is estimated 
that thousands of DNA nucleotides are damaged in a cell every day by such 
chemical processes. If unattended, this would quickly lead to unacceptably 
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high rates of mutations in germ cells (which would affect maintenance of the 
species) and in somatic cells (which would adversely affect the individual). 
Uncontrolled cell proliferation (i.e., cancer) is one deleterioxis outcome of the 
accumulation of genetic changes in a cell. 
5 Colls have a complex system for recognizing and repairing damage in 

the DNA, in order to maintain the fidelity of the genome. This process is 
called DNA repair. Based on yeast genetic studies^ it is estimated that 
eucaiyotic cells contain more than 50 genes involved in DNA repair functions^ 
reflecting the great importance of this process to the cell. 

10 Individuals with the autosomal recessive genetic disease called 

xeroderma pigmentosum (XP) have a defect in the DNA repair system. Such 
individuals are sun-sensitive (displaying an abnormal sensitivity to UV 
radiation) and have a marked predisposition toward skin cancer (2000-fold 
increased frequency). Somatic cell fusion experiments using cells derived 

15 from XP patients have defined seven complementation groups (A— G), 
suggesting that a multienz}nne complex is involved in efficient DNA repair. 
XP patients are defective in the 1[}ulky lesion" repair mechanism that is 
believed to be responsible for scaiming and repairing large changes in the 
structure of the DNA double helix. 

20 Two groups have described a protein that binds to damaged DNA that 

may represent the defective protein in XP group E patients (Takao et al.^ 
Nucleic Acid Res. 21:4111-4118, 1993; Hwang and Chu, Biochemistry 
32:1657-1666, 1993). The monkey gene 'TJV-DDB" was cloned by Takao et 
a/.; the human cognate of the UV-DDB gene was mapped to chromosome 11. 

25 Hwang and Chu purified the protein (XPE-binding factor) from human 
placenta. Both proteins are about the same size (125—127 kDa) and have a 
high binding affinity for damaged DNA. Both groups conclude their isolate 
is probably involved in the recognition step of the excision repair pathway. 
However, as the proteins difier in some biochemical properties, it is not yet 

30 clear if the two groups have recovered the same protein. 

A number of yeast excision repair genes (RAD) and rodent DNA-repair 
genes (ERCC) have been isolated. It has recently been shown that the RAD2, 
ERCC75, and XPGC genes are equivalent. This emphasizes the striking 
homology between eukaiyotic nucleotide-excision repair genes. 
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Of interest is the recent observation that repair protein XPBC/ERCC3 
is part of the basal transcription factor TFIIH. This suggests that the same 
cellular proteins are involved in two different aspects of DNA metabolism: 
initiation of transcription and excision repair. 

5 Summary of the Invention 

An object of the present invention is a method for diagnosing viral 
infection. 

An object of the present invention is a method for diagnosing cancer 
secondary to viral infection. 
10 A further object of the present invention is the provision of a nucleic 

acid sequence for XAP-1. 

Another aspect of the present invention is the provision of an amino 
acid sequence for the XAP-1 protein. 

Another object of the present invention is a method for monitoring 
15 viral infection or cancer secondary to viral infection. 

Another aspect of the present invention is the use of the XAP-1 gene 
as the object of screening assays to detect genetic alterations indicating a 
possible elevated risk of developing cancer. 

Thus, in accompUshing the foregoing objects, there is provided in 
20 accordance with one aspect of the present invention a method of treating viral 
diseases in an animal or hiunan, comprising the step of interfering with the 
interaction of a viral protein with a DNA repair complex in the animal or 
human to be tested. This same procedure can be used also to treat liver 
cancer secondary to viral infection. Other aspects of the invention include a 
25 nucleic acid sequence coding for the XAP-1 protein as well as the amino acid 
sequence for the XAP-1 protein and antibodies which bind to the XAP-1 
protein or bind to the separate peptides of the XAP-1 protein. 

The antibodies can be used to detect the infection, detect the cancer, 
monitor the effectiveness of therapy or monitor the stages of the cancer or 
30 infection. 

Other objects, features and advantages will be apparent and more 
readily imderstood from reading the following specification and by references 
to the accompanying drawings forming a part thereof. Examples of the 
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presently preferred embodiments of the invention are given for the purpose 
of disclosTire. 

Brief Description of the Drawings 
Figure 1 shows the transcriptional activation by reconstitution of GAL4 
5 activity. (A) Transcription of ZocZ (detected using chromogenic substrate 
X-GAL) does not occur when GAL4 does not bind GAL4 upstream activator 
sequence (UASc). (B) GAL4, which contains a DNA-binding domain (BIND) 
and a transcription activation domain (ACT), binds to UASq which results in 
activation of transcription. Bait (0) or prey (D) plasmids alone do not induce 
10 transcription. (E) Interaction of bait protein X and prey protein Y results in 
transcriptional activity. 

Figure 2 shows the nucleic acid sequence which encodes the human 
XAP-1 protein. 

Figure 3 shows the amino acid sequence of the XAP-1 protein. 
15 Figure 4 shows the plasmid preciirsor to the bait plasmid pAS-l-X. 

The GAL4 DNA-binding domain (amino acids Nos. 1-147) are fused in frame 
with X ORF. The Trp gene is used as a selective marker. 

Figure 5 shows the prey plasmid pACT. The GALA activation domain 
(amino acid Nos. 768-881) is fused with a cDNA library made from 
20 ElBV-transformed human lymphocytes. The Leu gene is used as a selective 
marker. The host cell is the yeast strain Y153 and the LocZ gene and His3 
gene are controUed by GAL4 responsive elements. 

Figure 6 shows the immunoprecipitation of labeled yeast extracts with 
anti-X antiserum. Lane one is the negative control. Lane two is the 
25 immimoprecipitate from yeast cells co-expressing X + LBP (Laminin-binding 
protein). Lane three is the immunoprecipitate from yeast cells co-expressing 
X and XAP-1. Anti-X antibody co-precipitated a 140 KD protein. 

Figure 7 shows the hydrophilicity plot for XAP-1. Underlined regions 
1 and 2 refer to peptides 1 and 2 used to generate antipeptide antibodies in 
30 rabbits. 

Figure 8 shows that XAP-1 synthesized in vitro in rabbit reticoilocyte 
lysates primed with XAP-1 mRNA is recognized by anti-peptide antisera in 
inmumoprecipitation tests. Lane 1 is a sample of the radiolabeled lysate; 
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XAP-1 migrates at about 127 kD. Lane 2 is a sample of lysate reacted with 
control serum* Lanes 3 and 4 are samples of Ijrsate reacted with anti-peptide 
2 serum and anti-peptide 1 senmi, respectively. 

Figure 9 shows the expression of XAP-1 in HEpG2 cells (human 
5 heptoma-derived cells). Panel A is a Northern blot showing that XAP-1 
mRNA is 4.4 kb in size. XAP-1 protein is expressed in the HEpG2 cells 
(Panels B and C). Cells were metabolically labeled with ["S]methionine, 
extracted, and immimoprecipitated with anti-peptide 2 antiserum (Panel B). 
XAP-1 (127 kD) plus several associated cellular proteins were recovered. 

10 When unlabeled HEpG2 cells were extracted, reacted with anti-peptide serum, 
and then anal3rzed in an immimoblot reaction using anti-peptide serum, 
XAP-1 protein was detected (Panel C). 

Figure 10 shows the association between X and XAP-1 when X is 
expressed as a GST fusion protein. Lane 1 is radiolabeled in vitro translation 

15 of XAP-1. The translation mixtures containing labeled XAP-1 were applied 
to gluthathione-sepharose beads containing immobilized control GST protein 
(lane 2) or GST-X fusion protein Gane 3). Boimd protein was eluted and 
analyzed by SDS-PAGE. Note that XAP-1 was recovered using the GST-X 
fusion protein. 

20 Figure 11 shows the procedxire used in the experiment in Figure 10 for 

verifying the interaction between X and XAP-1. 

Detailed Description 
It will be readily apparent to one skilled in the art that various 
substitutions and modifications may be made to the invention disclosed herein 
25 without departing from the scope and spirit of the invention. 

As used herein, DNA repair function refers to the general steps of DNA 
repair, including: (1) recognition of a lesion in the DNA; (2) incision of the 
damaged strand on both sides of the lesion; (3) excision of the damaged 
nucleotides; (4) synthesis of new DNA by copying the "good" complementary 
30 strand as template; and (5) ligation (sealing) of the nick left in the repaired 
strand. The end result of DNA repair is the restoration of two good copies 
of the DNA sequence. 
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As used herein, DNA repair complex refers to the chemical components 
involved in the intricate system of recognizing and repairing damage to DNA, 
particularly the mnltienzyme complexes that have been found to be associated 
with the recognition of a lesion in DNA, incision and removal of damaged 
5 nucleotides, synthesis of new DNA and ligation of the nick in the repaired 
strand. 

Gene therapy, as used herein, refers generally to the method of 
treating an individual with genetic material to treat a disease or other 
pathophysiological condition. 

10 As used herein, cancer secondary to HBV infection refers to the 

occurrence of carcinomas and other cancers in an individual as the result of 
infection with Hepatitis B Virus. 

One aspect of the present invention is a method of treating viral 
disease in a non-human animal or a himian, comprising the step of interfering 

15 with the interaction of a viral protein with a DNA repair complex in the 
animal or human to be treated. One specific example woiild be the treatment 
of Hepatitis B Virus (HBV) viral disease in non-human animals and hiunans. 
In this procedure, the interaction of the X protein of HBV with the XAP-1 
protein of the DNA repair complex is inhibited. Thus, acute and chronic 

20 infections may be treated and possibly cured by the use of inhibitory 
substances to block the interaction of X protein with the cellular DNA repair 
machinery. Such an inhibitory substance and approaches can include, but are 
not limited to: 

(a) Antiviral drugs to reduce expression of X protein. 
25 (b) A decoy synthetic peptide mimicking the interactive domain of either 
X or XAP-1 (or related proteins) to block their interaction. 

(c) Gene therapy vectors encoding such blocking peptides. 

(d) Antisense constructs or inhibitory oligonucleotides directed against 
HBV sequences that woiild block the synthesis of X protein. 

30 (e) Intracellular antibodies able to block the interaction of X with XAP-1 
or other components of the repair complex. 
(D Chemicals to enhance the level of nuclease activity or other activities 
of the repair process so that the repair pathway would not be 
overwhelmed by HBV DNA and X protein. 
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(g) Substances that would prevent the migration of HBV cores to the 
intracellular location where they contact the DNA repair mechanism. 
One skilled in the art readily recognizes that these approaches can be 
used to inhibit the interaction of a viral protein with the DNA repair complex. 
5 Another alternative embodiment includies the method of treating cancer 

secondary to viral infection in an animal or hxmian, comprising the steps of 
interfering with the interaction of a viral protein with a DNA repair complex 
in the animal or himian to be treated. One skilled in the art readily 
understands that in treating the cancer, one can reduce, delay or prevent the 

10 effects of viral protein on the cellular DNA repair system. This would allow 
the repair process to monitor and repair chromosomal DNA damage, 
preventing the accumulations of mutations in the cell. In one specific 
example of the present invention, liver cancer secondary to HBV infection can 
be treated by inhibiting the interaction of the X protein of HBV with the 

15 XAP-1 protein of DNA repair complex. 

Another embodiment is the use of the XAP-1 gene in diagnostic tests 
to identify those at increased risk of developing cancer. If the XAP-1 gene 
were to be mutated spontaneously or by esqiosure to viral infection or 
environmental factors, the loss of the normal DNA repair function might 

20 predispose cells to carcinogenic changes. Genetic errors would rapidly 
accumulate in those cells, some of which would affect important 
growth-regulatoiy genes. Similarly, individuals with inherited mutations in 
the XAP-1 gene mi^t be cancer-prone. Diagnostic tests based on the 
detection of altered forms of the XAP-1 gene or gene product would be useful 

25 to identify those at increased risk of cancer development. At-risk individuals 
could then be monitored to detect cancers at their earliest stage, when 
treatment is most successful. 

Althou^ the specific examples above for treatment of disease and for 
treatment of cancer involve the HBV, other viral infections that induce 

30 cellular DNA damage or that involve the DNA repair system at some stage 
in the virus host interaction may be prevented, suppressed, modulated and/or 
cured using an inhibitory substance to block the interaction of specific viral 
proteins with the DNA repair mechanisms of the host cell, and similar to that 
described for HBV and HBV X protein. This can include members of the 
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adeno-y arena-, bunya-, calici-, corona-, flavi-, hepadna-, herpes-, ortho-, 
papova-, paramyxo-, parvo-, picoma-, pox-, reo-, retro-, rhabdo-, and togavirus 
families, viroids, and agents not yet classified, and specifically would include 
hepatitis C virus. It would include target tissues in any organ system in the 
5 body, including but not limited to liver, gastrointestinal tract, respiratory 
system, skin, blood and blood-forming tissues, and the brain and nervous 
system. 

Another approach is the use of gene therapy. These approaches 
include blocking HBV X function and its interactions with cellular DNA 

10 repair components. One skilled in the art also readily recognizes that gene 
therapy approaches can be directed to XAP-1, related proteins, and other 
components of the multienz5ane DNA repair complex to modulate its ability 
to correct DNA damage induced by toxic chemicals, environmental toxins, 
viruses or other infectiotis agents, pharmaceuticals used to treat other 

15 conditions, or inherited genetic defects including mutator phenotypes. 

Another embodiment of the present invention is the XAP-1 gene 
nucleic acid sequence which encodes for the XAP-1 protein, shown in 
Figure 2. 

Another embodiment of the present invention is the amino acid 
20 sequence of the XAP-1 protein which is shovm in Figure 3. 

Polyclonal or monoclonal antibodies can be made against two synthetic 
peptides derived from the sequences of XAP-1, as well as to the whole XAP-1 
protein or other fragments of the XAP-1. This procedure includes preparing 
the polyclonal or monoclonal antibodies against the synthetic peptides frx>m 
25 the sequence, against authentic intact protein purified from eucaryotic cells 
or protein expressed using various express systems, including not only 
modified or unmodified intact protein, but the fragments of the protein also. 
Elxamples of these antibodies are shown in the examples of antibodies 
prepared against peptide 1 and peptide 2. 
30 In a specific embodiment of the present invention, the antibodies are 

used for diagnostic purposes. These antibodies can be used to identify defects 
in xeroderma, pigmentosiun patients, stage HBV infections or liver cancer 
development and to monitor the effectiveness of various therapeutic 
treatments using immunohistochemical, immunofluorescence. 
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immunoprecipitation, immunoblot, enzyme-linked immunosorbent assa3rs, or 
other immxmological methods. 

Example 1 

Identification of HBV X Protein Interactive Cellular 
5 Proteins Usmg the Yeast Two-Hybrid System 

Background about system: Understanding the functions of the HBV 
X protein reqtiires a knowledge of the protem-protein interactions that occur 
in different target cells and at different stages in the viral life qycle. The 
difBculty encountered so far in detecting X protein synthesized in 

10 HBV-infected human ceUs or in transgenic mouse cells has precluded studies 
of cellxilar proteins able to complex with the viral polypeptide. Even when 
foimd, X may well be an imstable, low-abimdance protein which will further 
complicate studies of protein-protein interactions. The present invention 
utilizes the yeast two-hybrid system to circiraivent these problems. 

15 The two-hybrid system was described by Fields and Song, Nature 

340:245-246, 1989, and modified by EUedge, [Durfee et al. Gems and Dev. 
7:555-569, 1993]. The procedure is a genetic method to identify and clone 
genes that interact with a protein of interest using in vivo complementation 
in yeast. The system rehes on the properties of the yeast GAL4 protein, 

20 which contains two separable domains responsible for transcriptional 
activation and DNA binding. Plasmids encoding two hybrid proteins, one 
consisting of the GAL4 DNA-binding domain fiised to protein of interest X 
(bait) and the other consisting of the GAL4 transcription activator domain 
fused to test protein Y (prey), are introduced into yeast. Interaction between 

25 bait and prey proteins leads to transcriptional activation of a reporter gene 
containing a binding site for GAL4. In principle, a mammalian cDNA library 
(prey plasmids) can be quickly screened provided neither participant (bait or 
prey) is self-activating. A schematic depicting transcription activation using 
the two-hybrid system, vidth facZ as reporter gene, is shown in Figure 1, The 

30 two-hybrid system has several advantages over other commonly used methods 
to stu^ protein-protein interactions: It is more sensitive than 
immxmoprecipitation; ptirified proteins are not required; and it can be used 
to isolate genes encoding proteins that are normally expressed at a low level. 
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The bait component of the tu)o-hybrid system: The claning vector was 
pASl, which carries as a selective marker TRPl (Figure 4). A bait plasmid 
that expressed GAL4 DNA-binding domain— X fusion protein was constructed 
using the HBV X gene. Saccharomyces cerevisiae host strain Y153 was used. 
5 Transformed cells were plated on synthetic complete (SC) media lacking 
tryptophan. As stable propagation of bait plasmids in yeast does not 
guarantee expression of the cloned test gene, HBV X protein expression was 
verified by immimoprecipitation of ^^labeled proteins from sonically lysed 
yeast cells, using our X antisera. 

10 Bait proteins that self-activate reporter genes cannot be used. The 

suitability of the X bait plasmids for the two-hybrid system was tested using 
a lezcZ reporter assay and a 3-aminothiazole (3-AT) susceptibility assay. The 
lacZ repoiter assay checks for activation of the yeast -galactosidase reporter 
gene. Transformed yeast cells were boimd to a reinforced nitrocellulose filter, 

15 permeabilized in Hquid nitrogen, and then incubated overnight in buffer 
containing X-gal; the colonies turn blue if lacZ is activated. The X plasmid 
did not display self-activation. 

The prey component of the two-h^ridsysteni: To search for interactive 
proteins, an activation-domain tagged cDNA expression library was 

20 co^transformed into yeast expressing the HBV X bait plasmid. Prey plasmids 
were derived from a lambda-phage (k) cDNA expression vector (X-ACT) 
(Figure 5). X-ACT encodes a plasmid that automatically excises, 
recircularizes, and propagates when grown in the proper E. coli strain, 
obviating the need for subcloning ^-amplified cDNAs. The cDNA library used 

25 was derived from Epstein-Barr virus-immortalized human lymphoqrtes. 

Recipient yeast containing bait plasmids were transformed with library 
DNA using a lithium sorbitol transformation protocol. Library transformed 
cells were placed for 3 hr in liquid SC media lacking His, Trp, and Leu to 
establish transformants and activate HISS transcription and then plated onto 

30 solid selective media. Colonies that grew after 3—5 days were tested using the 
lacZ reporter assay. Blue colonies were kept for further analysis. 

Screens to rule out false positives: Some blue colonies able to grow on 
selective media may not be true positives due to specific bait and prey protein 
interactions. Approaches to rule out false positives included the nonspecific 
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interactor test. In this test, false-positives were ruled out by determining if 
they reacted nonspecifically with a panel of indicators expressed as bait fusion 
proteins. The panel contained lamin c, hiiman qrclin D, truncated p53, and 
yeast protein SNFl. Prey fusion proteins that interacted with some or all 
5 panel proteins were considered to be false positives. One true positive that 
presimiably encoded an X-associated protein was designated XAP-1 (X- 
associated protein 1). The laminin-binding protein and the beta subunit of 
G protein are partiaUy X-specific. 

Identification of HBV X interactive proteins: True positive prey 

10 plasmids were DNA sequenced and an identity scan performed using a 
databank search. Althou^ the XAP-1 gene was not represented on the first 
screen, it later appeared in GenBank (accession no. L20216) as a related 
sequence for a UV-damaged DNA-binding (UV-DDB) protein recovered from 
a monkey cell cDNA library. The monkey UV-DDB gene is about 98% 

15 homologous to the human XAP-1 gene at both the nucleotide and amino acid 
levels. (Fifty-one nucleotide mismatches and one amino acid mismatch.) 

Preparation of immunological reagents against XAP-l: We selected 
hydrophihc sequences from XAP-1, based on analysis of the gene sequence, 
and prepared anti-peptide antisera to be used for subsequent protein 

20 characterization (Figure 7). The peptides used were the following: peptide 
1 - REKEFNKGPWKQENVE (amino adds 198-213); peptide 2 - 
QYDDGSGMKREATA (amino acids 1113—1126). Antibodies were raised that 
recognized XAP-1 synthesized in vitro in rabbit reticulocyte lysates primed 
with XAP-1 mRNA, as well as endogenous protein synthesized in human 

25 hepatoma-derived HEpG2 cells (Figures 8 and 9). 

Proof of proteinrprotein irUeraction: Two approaches were used to 
verify HBV X protein interaction with XAP-1. Yeast cells expressing the two 
proteins were labeled with [^]methionine, extracted, and the lysate 
precipitated with anti-X antibody; GAL4— XAP-1 fusion protein was 

30 co-inmiunoprecipitated (Figure 6). In a second approach to prove binding, X 
was expressed as a glutathione S-transferase (GST) fusion protein in E, colL 
The GST-fusion protein was immobilized on glutathione-sepharose beads, and 
*S-labeled in vitro translation mixtures containing the XAP-1 protein were 
applied to the coliunn. Boimd protein was eluted and analyzed by 
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SDS-PAGE. XAP-1 was recovered (Figure 10). An outline of the procedure 
is shown in Figure 11. 

Example 2 
Model of Significance to HBV Infections 
5 And Disease Pathogenesis 

The model is that HBV usurps a normal DNA repair pathway in the 
cell to convert its partially douhle-stranded DNA genome to a covalently 
closed circular form, a necessary step in the virus life cycle. This step must 
occur with each progeny DNA molecule that is generated, either in newly 

10 infected cells, or as new roimds of replication are initiated in persistently 
infected cells. A cellular process presumably accomplishes this step; the viral 
polymerase fimctipns to reverse transcribe pregenomic RNA to a partial 
double-stranded DNA, and inhibition of viral polymerase activity reportedly 
had no effect on generation of CCC DNA. Primer removal, and plus-strand 

15 completion must be early steps in the initiation of new rounds of replication. 
The steps necessary to convert the incomplete HBV genome to an intact 
infectious form are reminiscent of the repair process designed to correct 
damaged cellular DNA. 

It is believed that the HBV X protein is required for the recruitment 

20 and/or functioning of the cellular repair, proteins to repair the HBV genome 
or to trigger a series of changes that makes the hepatocyte more permissive 
for viral repUcation. The involvement of the X protein is to direct the cellidar 
complex from a cellular site to HBV DNA, to stabilize the interaction of the 
multienzyme complex with HBV DNA, to displace or inhibit one or more 

25 enzymatic processes that are part of the normal DNA repair process that 
might be deleterious to HBV DNA, to enhance the activity of the complex to 
make it more beneficial for HBV, and/or to alter the function of the repair 
process in some other way. 

Therefore, if the involvement of the cellular DNA repair mechanism in 

30 the repair and completion of the HBV genome is blocked, the repUcation of 
HBV is inhibited. It is thus possible to eliminate chronic infections with 
HBV, as well as to treat acute infections. 
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This model helps explain the role of HBV in Uver cancer. The DNA 
repair proteins are in close proximity to chromosomal DNA and, as the 
genomic HBV DNA is being repaired, the viral DNA is, on occasion, 
accidentally integrated into the chromosome. If an important cellular gene 
5 were interrupted or affected, this could have profoimd effects on the cell. 
Inhibiting this on-going repair process of HBV DNA would decrease the 
niunber of integration events of HBV DNA and reduce the likelihood of 
eventual development of HCC. In addition, the involvement of X protein in 
redirecting the DNA repair process may be efBcient enough that it interferes 

10 with normal repair of chromosomal DNA, resulting in the accimiulation of 
mutations and genetic instability, eventually leading to cancer development. 
Therefore, blocking the interaction of X with the cellular repair mechanism 
will decrease the development of HCC. 

This model explains several other poorly imderstood observations in 

15 the HBV S3rstem. The proposed function of X as a protease inhibitor might 
serve to d^nreguJate the cellular DNA repair complex by inhibiting a specific 
cleavage of a member of the complex in order to accentuate a function 
especially beneficial to HBV repair. As it now appears that some of the same 
cellular proteins are involved in initiation of transcription, excision repair, 

20 and DNA replication, this could explain the function of X protein as a 
promiscuous transcriptional activator. The X protein may be altering to a 
greater or lesser extent components of the transcription complex at the same 
time that it is modifying the DNA repair mechanism to meet the needs for 
HBV DNA repair. These effects on cellular processes would occur in cells 

25 expressing X protein from an integrated HBV genome, even in the absence of 
replicating HBV, because of the ability of X protein to interact with the 
cellular target proteins. The X protein has never been observed to exhibit 
DNA-binding activity. It is probable that X protein interaction with XAP-1 
and possibly other members of the repair complex is necessary to enable X to 

30 affect the DNA-based processes. 

Although the involvement of the X protein is required for the 
recruitment of the repair complex to repair HBV DNA, it is not imexpected 
that X has been reported to be dispensable for virus rephcation in tissue 
culture. Elstablished cells in culture frequently contain mutations and display 
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altered patterns of gene expression; redundant biochemical pathways are 
expressed that would be normally repressed in hepatoqrtes in vivo. There are 
many examples of viral genes that are essential for viral growth in a host 
organism that are not necessary for viral growth in tissue cxilture cells. 
5 All patents and publications mentioned in this specification are 

indicative of the levels of those skilled in the art to which the invention 
pertains. All patents and puhUcations are herein incorporated by reference 
to the same extent as if each individual publication was specifically and 
individually indicated to be incorporated by reference. 

10 One skilled in the art will readily appreciate that the present invention 

is well adapted to carry out the objects and obtain the ends and advantages 
mentioned, as well as those inherent therein. The methods, procedures and 
techniques described herein, as well as the antibodies, nucleic acid sequence 
and amino acid sequences are presently representative of the preferred 

15 embodiments, are intended to be exemplary, and are not intended as 
limitations on the scope. Changes therein and other uses which are 
encompassed within the spirit of the invention or defined by the scope of the 
appended claims will occiir to those skilled in the art. 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATGTCGTACA ACTACGTGGT AACGGCCCAG AAGCCCACOG CCGTGAACGG CTGCGT6ACC 60 
6GACACTTTA CTTCG6CC6A AGACTTAAAC CTGTT6ATTG CCAAAAACAC 6A6ATTA6AG 120 
ATCTATGTGG TCACCGCCGA GGGGCTTCGG CCTGTCAAAG AGGTGGGCAT GTATGGGAAG 180 
ATTGCGGTCA T6GAGCTTTT CAGGCCCAAG GGGGAGAGCA AGGACCTGCT GTTTATCTTG 240 
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ACA6C6AAGT 


ACAATGCCTG 


CATCCTGGAG 


attacgcgag 


^^^^^ Ik fn A M M n ih 

CCCATGGCAA 


TGTCCAGGAC 


ATTGGCATCA 


TTGACCCTGA 


GTGCCGGATG 


AAGGTTATTC 


CACTA6ATCG 


06ATAATAAA 


GA6CTGCATG 


TCATTGATGT 


CAAGTTCCTA 


GTCTACCAGG 


ACCCTCAGGG 


GCGGCACGTA 


6AATTCAATA 


AGGGCCCTTG 


GAAACAGGAA 


GCAGTCCCAG 


AGCCCTTT66 


GGGGGCCATC 


AATGGTGACA 


AATACCTGGC 


TATTGCCCCT 


CACAATCGAG 


TGGACCCTAA 


TGGCTCAA6A 


TTCATGCT6C 


TTTTGGAGAA 


GGAGGAACAG 


CGTGTAGAAC 


TCCTTG6AGA 


GACCTCTATT 


GTTGTGTTTG 


TOGGGTCTCG 


CCTGGGTGAC 


AAT6AACAAG 


GCTCCTATGT 


AGTGGCCATG 


GATAT6TGCG 


TGGTGGACCT 


G6AGAG6CAG 


6CTTTCAAGG 


AAGGTTCTTT 


60GGATCATC 


AGCATTGACT 


TACGAGGCAT 


CAAAG6ATTA 


ACTGATGAGA 


CTTTGGTGCT 


CTCTTTTGTG 


GAGGAGGTAG 


AAGAAACCGA 


ACTGATGGGT 


GGCAA06TGG 


CTCATCAGCA 


GCTTATCCAG 


CAAGAACCCA 


AAGCTCTGGT 


CAGTGAATGG 


GCCTCCTGCA 


ATA6CAGCCA 


GGTGGTGGTG 


ATCXaiTCCTC 


AGGAGCTCCG 


GCAGATCAGC 


TTGGACATCA 


CCCGATTAGG 


AGACAGCAAT 


TGGACGGAGA 


TCTCGGCTCG 


TATCTTGAAG 


ATGCTGGGTG 


6AGAGATCAT 


TCCTOGCTCC 


TACCTCCTTT 


GTGCCTTCGG 


AGATGGAGCG 


G6TCTGTTGA 


GOGACOGTAA 


GAAGG7GACT 


TTTOGTTCTC 


TTTCTACCAC 


CAAC6TCTTT 


A6CAGCAACC 


ACAAATTGGT 


CTTCTCAAAT 


CCCCTCAATT 


CA6AT6GCTA 


TCCTGACAGC 


ATTGGCACCA 


TOGATGAGAT 


CCAGAAGCTG 


CCAAGGAAGA 


TCTGCTACCA 


GGAAGTGTCC 


GAAGTCCAAG 


ACACGAGTGG 


GGGCAOGACA 



20 

TATAAACAGA GTGGCGAGAG CATTGACATC 300 
CGCATTGGCC GCCCCTCA6A GACC6GCATT 360 
ATTGGCCTGC GTCTCTATGA TGGCCTTTTC 420 
GAACTCAAGG CCTTCAACAT CCGCCTGGAG 480 
TATGGTTGCC AAGCACCTAC TATTTGCTTT 540 
AAAACCTATG AGGTGTCTCT CCGAGAAAAG 600 
AATGTCGAAG CTGAAGCTTC CATGGTGATC 660 
ATCATTGGAC AGGAGTCAAT CACCTATCAC 720 
CCTATCATCA AGCAAAGCAC 6ATTGT6TGC 780 
TACCTGCTGG GAGACATGGA AGGCCGGCTC 840 
ATGGATGGCA CCGTCACTCT CAAGGATCTC 900 
GCTGAGTGCT TGACATACXTT TGATAATGGT 960 
TCCCAGCTTG TGAAGCTCAA CGTTGACAGT I02D 
GAAACCTTTA CCAACTTAGG ACCCATTGTC 1060 
GGGCAGGGGC AGCTGGTCAC TTGCTCTGGG U40 
CGGAATGGAA TTGGAATCCA CGAGCATGCC 1200 
TGGCCACTGC G6TCTGACCC TAATCGTGAG 1260 
GGCCAGACAA GAGTTCTCAT GTTAAATGGA 1320 
TTCGTGGATG ATCA6CAGAC TTTCTTCTGT 1380 
ATCACTTCAG CATCGGTGA6 GTTGGTCTCT 1440 
AAGGAGCCTC AGGCCAAGAA GATCAGTGTG 1500 
GCT6TAGGCA G6GCCCTCTA CTATCTGCAG 1560 
CACACAGAGA TGGAACATGA A6TGGCTTGC 1620 
6GACTGTCCC CTCTTT6TGC CATTGGCCTC 1680 
TTGCCCTCTT TTGAACTACT GCAGAAGGAG 1740 
ATCCTGATGA CCACCTTTCA GAGTAGCCAT 1800 
CTTTTCTACT TTGGGCTCAA GATTGAGACA 1860 
TTGGGCACCC AGCCCAC06T ATTGAGGACT 1920 
GCTTGTTCTG ACCGCCCCAC TGTCATCTAT 1980 
GTCAACCTCA AGGAAGTGAA CTACATGT6T 2040 
CTGGCGCTGG CCAACAATAG CACCCTCACC 2100 
CACATTOGCA CAGTTCCCCT CTATGAGTCT 2160 
CAGTGTTTCG GGGTCCTCTC CAGCCGCATT 2220 
GCCTTGAGGC CCAGC6CTAG CACCCAGGCT 2280 
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Tr*P AfiO & 


V X \* X AAu^ X w 


OAfSr'AAnPTG 

^mAW^^AAW W X w 


TTCTCCAGCA 


GCACTGCTCC TCATGAGACC 


2340 


X U w X X X vunw 


AAU A^TU X Va^A 


^SrSTfSPAPAAP 

WW X W WA^^AAW 


CTACTTATCA 

WX**W^ .AcbX Ws« 


TTGACCAACA 


CACCTTTGAA 


24G0 


uXvwX XwAX V 


UrWV^AV^V^AU X X 


•PmYSf ARAAT 

X ^ X WWAWAAX 


GAATATGCCC 


TCAGTCTGGT 


TTCCTGCAAG 


2460 


\^ X vxv vWnnnu 


A OfV*n A Af!AP 
A^^\>*WAAV«A%M> 


X X Aw X X A 


GTGGGCACAG 


CAAT6GTGTA 


TCCTGAAGAG 






AW V^AU\9\9 X 


WJIX XWXWWX W 


TTTCAGTATT 


0GGAT6GAAA ACTACAGACT 


2580 


ly X X \vnA/i 


Ar2/: A AR1VS A A 
Avv AAU X V AA 


AWWWW\«^^WX w 


TACTCTATGG 

xnwx wx«»x 


TGGAATTTAA 


CGGGAAGCTG 


2640 


X X AGCUAv 


T/* A ATAnr*Ar* 

X WaA X A\>^JAV.» 


WW X W WWWW X Vir 


TATGAGTGGA 

X AXWAWXWWA 


CAACAGAGAA 


GGAGCTGCGC 


2700 


X vAu X vL*A 


A Of A f*T A n A A 
AwwA^ X AwAA 


PAAPATPATC 

WAA^^\X WAX W 


CCCCPCTACC 


TGAAGACCAA 


GGGCGACTTC 


2760 


AX^^XuvXuw 


fiOft APPTT AT 

VWUAw^X XAX 


GCGCTCAGTG 


CTGCTGCTTG 


CCTACAAGCC 


CATGGAAGGA 


2820 


X X XA9/\Av 


AGATTCCTCG 

AW A A X WW A WW 


ACACTTTAAT 

^uvn\^x X xnn A 


CCCAACTGGA 


TGAGTGCTGT 


GGAAATCTTG 


2880 


U A X V A X l3 A wA 


AX X X xwxwww 


GGCT6AAAAT 

W X ^fCWU* X 


GCCTTTAACT 


TGTTTGTGTG 


TCAAAAGGAT 


2940 


ACCGCTGCCA 




G6A6C6GCAG 


CACCTCCAGG 


AGGTTGGTCT 


TTTCCACCTG 


3000 


GGi^ AG XXX. l> 


X L>AAX w X ^ X X 


X Xw^ Wiwww\^ 


TPTPTRGTAA 
X\^XwXWWXAA 


TGCAGAATCT 


GGGTGAGACT 


3060 


TCCACCCCCa 


CACAAwwLTXL. 


GGTGCTCTTC 


ft/IP A^WTPA 
wwV«AlaAvw X wA 


ACGGCATGAT 


A6GGCTGGTG 




ACCTCACTGT 


CAGA6AGCTG 


GTACAACCTC 


CTGCTGGACA 


TGCAGAATCG 


ACTCAATAAA 


3180 


GTCATGAAAA 


GTGtGGGGAA 


GATOGAGCAC 


TCCTTCTGGA 


GATCCTTTCA 


CACCGAGOGG 


3240 


AA6ACA6AAC 


CAGCCACAGG 


TTTCATCGAC 


GGTGACTTGA 


TTGAGA6TTT 


CCTGGATATT 


3300 


AGCC6CCCCA 


AGATGCAGGA 


GGTGGTGGCA 


AACCTACAGT 


ATGAOGATGG 


CAGCGGTATG 


3360 


AA6CGAGA66 


CCACTGCAGA 


CGACCTCATC 


AAGGTTGTGG 


AGGAGCTAAC 


TOGGATCCAT 


3420 



TAG 3423 
(2) INFORMATION FOR SEQ ID NO:2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH: 1140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Net Ser Tyr Asn Tyr Val Val Thr Ala Gin Lys Pro Thr Ala Val Asn 
15 10 15 

Gly Cys Val Thr Gly His Phe Thr Ser Ala Glu Asp Leu Asn Leu Leu 
20 25 30 

lie Ala Lys Asn Thr Arg Leu Glu lie Tyr Val Val Thr Ala Glu Gly 
35 40 45 

Leu Arg Pro Val Lys Glu Val Gly Met Tyr Gly Lys lie Ala Val Met 
50 55 60 
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Glu Leu Phe Arg Pro Lys Gly Glu Ser Lys Asp Leu Leu Phe lie Leu 
65 70 75 80 

Thr Ala Lys Tyr Asn Ala Cys lie Leu Glu Tyr Lys Gin Ser Gly Glu 
85 90 95 

Ser lie Asp lie lie Thr Arg Ala His Gly Asn Val Gin Asp Arg He 
100 105 110 

Gly Arg Pro Ser Glu Thr Gly He He Gly He He Asp Pro Glu Cys 
115 120 125 

Arg Met He Gly Leu Arg Leu Tyr Asp Gly Leu Phe Lys Val He Pro 
130 135 140 

Leu Asp Arg Asp Asn Lys Glu Leu Lys Ala Phe Asn He Arg Leu Glu 
145 150 155 160 

Glu Leu His Val He Asp Val Lys Phe Leu Tyr Gly Cys Gin Ala Pro 
165 170 175 

Thr He Cys Phe Val Tyr Gin Asp Pro Gin Gly Arg His Val Lys Thr 
ISO 185 190 

Tyr Glu Val Ser Leu Arg Glu Lys Glu Phe Asn Lys Gly Pro Trp Lys 
195 200 205 



Gin Glu Asn Val Glu Ala Glu Ala Ser Met Val He Ala Val Pro Glu 
210 215 220 

Pro Phe Gly Gly Ala He He He Gly Gin Glu Ser He Thr Tyr His 
225 230 235 240 

Asn Gly Asp Lys Tyr Leu Ala He Ala Pro Pro He He Lys Gin Ser 
245 250 255 

Thr He Val Cys His Asn Arg Val Asp Pro Asn Gly Ser Arg Tyr Leu 
260 265 270 

Leu Gly Asp Met Glu Gly Arg Leu Phe Met Leu Leu Leu Glu Lys Glu 
275 280 285 

Glu Gin Met Asp Gly Thr Val Thr Leu Lys Asp Leu Arg Val Glu Leu 
290 295 300 

Leu Gly Glu Thr Ser He Ala Glu Cys Leu Thr Tyr I«eu Asp Asn Gly 
305 310 315 320 

Val Val Phe Val Gly Ser Arg Leu Gly Asp Ser Gin Leu Val Lys Leu 
325 330 335 

Asn Val Asp Ser Asn Glu Gin Gly Ser Tyr Val Val Ala Met Glu Thr 
340 345 350 

Phe Thr Asn Leu Gly Pro He Val Asp Met Cys Val Val Asp Leu Glu 
355 360 365 

Arg Gin Gly Gin Gly Gin Leu Val Thr Cys Ser Gly Ala Phe Lys Glu 
370 375 380 

Gly Ser Leu Arg He He Arg Asn Gly He Gly He His Glu His Ala 
385 390 395 400 

Ser He Asp Leu Pro Gly He Lys Gly Leu Trp Pro Leu Arg Ser Asp 
405 410 415 



wo 95/10288 



PCr/US94/11451 



10 
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Pro Asn Arg Glu Thr Asp Aap Thr Leu Val Leu Ser Phe Val Gly Gin 
420 425 430 

Thr Arg Val Leu Met Leu Asn Gly Glu Glu Val Glu Glu Thr Glu Leu 
435 440 445 

Met Gly Phe Val Asp Asp Gin Gin Thr Phe Phe Cys Gly Asn Val Ala 
450 455 460 

His Gin Gin Leu lie Gin lie Thr Ser Ala Ser Val Arg Leu Val Ser 
465 470 475 480 

Gin Glu Pro Lys Ala Leu Val Ser Glu Trp Lys Glu Pro Gin Ala Lys 
485 490 495 

Asn lie Ser Val Ala Ser Cys Asn Ser Ser Gin Val Val Val Ala Val 
500 505 510 

Gly Arg Ala Leu Tyr Tyr Leu Gin lie His Pro Gin Glu Leu Arg Gin 
515 520 525 

lie Ser His Thr Glu Met Glu His Glu Val Ala Cys Leu Asp He Thr 
530 535 540 



20 



25 



30 



35 



40 



Pro Leu Gly Asp Ser Asn Gly Leu Ser Pro Leu Cys Ala He Gly Leu 
545 550 555 560 

Trp Thr Asp He Ser Ala Arg He Leu Lys Leu Pro Ser Phe Glu Leu 
565 570 575 

Leu His Lys Glu Met Leu Gly Gly Glu He He Pro Arg Ser He Leu 
580 585 590 

Met Thr Thr Phe Glu Ser Ser His Tyr Leu Leu Cys Ala Leu Gly Asp 
595 600 605 

Gly Ala Leu Phe Tyr Phe Gly Leu Asn He Glu Thr Gly Leu Leu Ser 
610 615 620 

Asp Arg Lys Lys Val Thr Leu Gly Thr Gin Pro Thr Val Leu Arg Thr 
625 630 635 640 

Phe Arg Ser Leu Ser Thr Thr Asn Val Phe Ala Cys Ser Asp Arg Pro 
645 650 655 

Thr Val He Tyr Ser Ser Asn His Lys Leu Val Phe Ser Asn Val Asn 
660 665 670 

Leu Lys Glu Val Asn Tyr Met Cys Pro Leu Asn Ser Asp Gly Tyr Pro 
675 680 685 

Asp Ser Leu Ala Leu Ala Asn Asn Ser Thr Leu Thr He Gly Thr He 
690 695 700 

Asp Glu He Gin Lys Leu His He Arg Thr Val Pro Leu Tyr Glu Ser 
705 710 715 720 

Pro Arg Lys He Cys Tyr Gin Glu Val Ser Gin Cys Phe Gly Val Leu 
725 730 735 

Ser Ser Arg He Glu Val Gin Asp Thr Ser Gly Gly Thr Thr Ala Leu 
740 745 750 

Arg Pro Ser Ala Ser Thr Gin Ala Leu Ser Ser Ser Val Ser Ser Ser 
755 760 765 
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I*ys Leu Phe Ser Ser Ser Thr Ala Pro His Glu Thr Ser Phe Gly Glu 
770 775 780 

Glu Val Glu Val His Asn Leu Leu lie lie Asp Gin His Thr Phe Glu 
785 790 795 800 

Val Leu His Ala His Gin Phe Leu Gin Asn Glu Tyr Ala Leu Ser Leu 
805 810 815 

Val Ser Cys Lys Leu Gly Lys Asp Pro Asn Thr Tyr Phe 11^ Val Gly 
820 825 830 

Thr Ala Met Val Tyr Pro Glu Glu Ala Glu Pro Lys Gin Gly Arg lie 
835 840 845 

Val Val Phe Gin Tyr Ser Asp Gly Lys Leu Gin Thr Val Ala Glu Lys 
850 855 860 

Gin Val Lys Gly Ala Val Tyr Ser Met Val Glu Phe Asn Gly Lys Leu 
865 870 875 880 

Leu Ala Ser lie Asn Ser Thr Val Arg Leu Tyr Glu Trp Thr Thr Glu 
865 890 895 

Lys Glu I«eu Arg Thr Glu Cys Asn His Tyr Asn Asn lie Met Ala Leu 
900 905 910 

Tyr Leu Lys Thr Lys Gly Asp Phe He Leu Val Gly Asp Leu Met Arg 
915 920 925 

Ser Val Leu Leu Leu Ala Tyr Lys Pro Met Glu Gly Asn Phe Glu Glu 
930 935 940 

He Ala Arg Asp Phe Asn Pro Asn Trp Met Ser Ala Val Glu He Leu 
945 950 955 960 

Asp Asp Asp Asn Phe Leu Gly Ala Glu Asn Ala Phe Asn Leu Phe Val 
965 970 975 

Cys Gin Lys Asp Ser Ala Ala Thr Thr Asp Glu Glu Arg Gin His I«eu 
980 985 990 

Gin Glu Val Gly Leu Phe His Leu Gly Glu Phe Val Asn Val Phe Cys 
995 1000 1005 

His Gly Ser Leu Val Met Gin Asn Leu Gly Glu Thr Ser Thr Pro Thr 
1010 1015 1020 

Gin Gly Ser Val Leu Phe Gly Thr Val Asn Gly Met He Gly Leu Val 
1025 1030 1035 1040 

Thr Ser Leu Ser Glu Ser Trp Tyr Asn Leu lieu Leu Asp Met Gin Asn 
1045 1050 1055 

Arg Leu Asn Lys Val He Lys Ser Val Gly Lys He Glu His Ser Phe 
1060 1065 1070 

Trp Arg Ser Phe His Thr Glu Arg Lys Thr Glu Pro Ala Thr Gly Phe 
1075 1080 1065 

He Asp Gly Asp Leu He Glu Ser Phe Leu Asp He Ser Arg Pro Lys 
1090 1095 nop 

Met Gin Glu Val Val Ala Asn Leu Gin Tyr Asp Asp Gly Ser Gly Met 
1105 1110 1115 1120 
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Lys Arg Glu Ala Thr Ala Asp Asp Leu lie Lys Val Val Glu Glu Leu 
1125 1130 1135 

Thr Arg He His 
1140 
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CLAIMS 

WHAT IS CLAIMED IS: 

1. A method of treating viral disease in an animal or human, 
comprising the step of interfering with the interaction of a viral protein with 

5 a DNA repair complex in the animal or hiunan to be treated. 

2. The method of claim 1, wherein the viral disease is caused by 
the Hepatitis B Virus (HBV) and the interaction of the X protein of HBV with 
XAP-1 of the DNA repair complex is inhibited. 

3. A method of treating cancer secondary to viral infection in an 
10 animal or human, comprising the steps of interfering with the interaction of 

a viral protein with a DNA repair complex in the animal or human to be 
treated. 

4. The method of claim 3, wherein the liver cancer is secondary to 
HBV infection and the interaction of the X protein of HBV with XAP-1 of the 

15 DNA repair complex is inhibited. 

5. A method of identifying persons at increased risk of developing 
cancer, comprising detecting alterations in the XAP-1 gene of an individual. 

6. The method of claim 5, wherein said XAP-1 gene alterations are 
detected by one or more of the following techniques: restriction enzyme 

20 analysis, polymerase chain reaction assays, nucleic acid hybridization, and/or 
using any of various immimological assays. 

7. The nucleic acid sequence of XAP-1 protein shown in Figure 2, 

8. The amino acid sequence of XAP-1 protein shown in Figure 3. 



9. An antibody which binds to the protein of claim 6. 
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10. The antibody of claim 7 which binds to peptide 1, or peptide 2 
or both. 

11. A method of detecting patients with HBV infection, comprising 
the step of combining a sample from said patient with the antibody of claim 

5 7 and measuring the amoimt of antibody-antigen interaction. 

12. A method of monitoring the effectiveness of therapeutic 
treatments, comprising the step of combining a sample from said patient with 
the antibody of claim 9 and measuring the amount of antibody-antigen 
interaction. 

10 13. A method of determining the stage of liver cancer secondary to 

HBV infection, comprising the step of combining a sample from said patient 
with the antibody of claim 9 and measuring the amoimt of antibody-antigen 
interaction. 
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FIGUBE2(l/2) 

ATGTCGTACAACTACmXjGTAACGGaXAGAAGCXXACCXXXXnXj 

GCGTGACX^GGACACTTTACTTCXKKXGAAGACTrAAAC^ 

AACACGAGATTAGAGATCTATGTGGTCACCXKXXjAGGG GCTrO GG^ 

AAGAGGTGGGCATGTATGGGAAGATTGCGGTCATGGAGCnr^ 

GGGGGAGAGCAAGGACCTGCTGTTTATCTIGACAGCXjAAGTACAATG<^^ 

ATCCrcGAGTATAAACAGAGTCGCXjAGAGCATTGACATCATTAOGOT 

ATCGCAATXJTCCAGGACJCGCATKXKXXXXXX:^ 

CATCATTGACXXnX3AGTGCXXXSATGATTGG<XTGC^^ 

TCAAGGTTATTXXTVCTAGATaKXjATAATAAAGAACTCAAGGa^^ 

CXXKXnXKIAGGAGCIXXIATGTCATTGATGTCA^ 

CACCTACTATITGCrrTGTCTAIXAGGACOCTCAGGGGCXXX^ 

TATGAGGTGTCTClOaSAGAAAAGGAATrCAATAAGGGCCXnTGGAA^ 

AAAATGTCGAAGCTOAAGCTTCXIATGGTGATCGCAGIXXX^ 

GGGGGCCATCATCATTGGACAGGAGTCAATCACCTATCACAA^ 

TACXnX3GCrATTGa:CCTCCrATCATCAAGCAAAGCACGAT^^ 

TCX}AGTGGACCXn'AATGGCrcAAGATA<XIXXnGGGAGACAT^ 

CTCTTCATGCrGCrmGGAGAAGGAGGAACAGATGGATGGCAC^ 

CAAGGATCrcCXnxn'AGAACTCXTIGGAGAGACXnXTATraCro 

CATACXnTGATAATGGTGTTGTGTITGTCXXXnX^^ 

Cn-GTGAAGCTCAACGTTCACAGTAATOAACAAGGCTXXTATG^^ 

TGGAAAOTITACXIAACTrAGGACXXWrTGTCXSATATGTG^^ 

GAGAGGCAGGGGCAGGGGCAGCTGGTCACTTGCrcnXSGGGCITr^ 

GTTCTTTGCXKSATCATCCXKjAATGGAATTGGAATC^ 

GACTTACCAGGCATCAAAGGATrATGGCCAClXXXXnxnGACXCT 

agactgatgacactttggtgctctcttttgtggg 
ttaaatggagaggaggtagaagaaacx:gaactgatgggtitogtggatgatc 

AGCAGACTTTCnrClOTGGCAACXnXSGCTCATCAG 

TCAGCATCGGTGAGGTTOGTCIUTCAAGAACXCAAAGCnOG^ 

GGAAGGAGCCTCAGGCCAAGAACATCAGTGTGG<XrrOCTGC^ 

GGTGGTGGTGGCnX3TAGGCAGGG0a:rCrACTATCnX^ 

AGCTCCGGCAGATCAGOCACACAGAGATGGAACATCAAGTGGCTrGCT^^ 

CATCACOOCATTAGGAGACAGGAATGGACIXnXXXXnUITTGl^^ 

TCTGGACXKjACATCTCGGCTCXjrATCTrGAAGTTXX^ 

CACAAGGAGATGCTGGGTGGAGAGATCATIXXTXXXnm^T(XTGATC 

CXrrTGAGAGTAGCCATTAOCnXXnTIGTGanTGGGAGATGGAG 

TACTTTGGGCrcAACATTGAGACAGGlCTGTTCAGCXSACCXn-AAGAAGC^ 

CrrTGGGCACXX:AGCCCACX33TATTGAGGACTIT^ 

ACGTCTITGCTroTTCTGACCGCOCX:ACTGT^ 

TTGGTCTTCTCAAATGTCAACCTCAAGGAAGTGAACT^ 

TTCAGATGGCTATCCTCACAGCCTGGCGCTGGCCAACAATAGCACXXITC^ 

ATTGGCACCATCGATGAGATOCAGAAGCTGCACATTCGCACAGTR^^ 

TGAGTCia:AAGGAAGATCIGCrA(XAGGAAGTGTCCCAG^^ 

CTCTXX:AGCXXX:ATroAAGlXX:AAGACACGAGTGGGGGCACGACAGCC^ 

GGC(XAGCGCTAGCACa:AGGCTCTGT<XAGCAGTGTAAGCrCCAGCAAGCT 

GTTCTCCAGCAGCACTGCnXTCATGAGACCT CCTTT GGAGAAGAGGTGGAG 

GTCCACAACCTACrTATCATrGACCAACACACXnTrGAAGTGCnTC 
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FlGUBE2(2/2) 

(XAGTTTCrcCAGAATGAATATGICCCrCAGT^ 

AAGACXXXIAACACnTACnTCATTGTGGGCACAGG^ATGCTO 

GGCAGAGCCGAAGCAGGGTCGCATTGTGGTCirrcAGTATT^^ 

CTACAGACrcTGGCTGAAAAGGAAGTGAAAGGGGCXX5TXn*ACrCT^ 

AATrTAAaKK5AAGCIXnTAG(XL\GCATCAATAGCACXK5TCC^^ 

GTGGACAACAGAGAAGGAGCTGCGCACTCAGTGCAACCACrACAACAACAT 

CATGGaxnXTACCIGAAGACCAAGGGCXJACnTGATa^^ 

ATGa3CrcAGTGCrGCTX}CTrGCCTACAAGCCCATGGAATO 

AGATTGCltXjAGACTrTAATCXXIAACroGATO 

GATGACAATITrcrGGGGGCrGAAAATGCX:TTTAACTIGT^^ 

GGATAGCXKnXKX:ACX:ACIGACXjAGGAGCGGCAGCAOTXX:AGGAG<^^ 

aillC CACCnXXXKXSAGTTTGTCAATGTCTTrT^^ 

CAGAATCIGGGTGAGACTTOCA(XCXX:ACACAAGGCn^ 

CX3GTCAACXKK:ATGATAGGGCTGGTGACCTCACrGT^ 

CCTCCTGCTGGACATGCAGAATCGACTCAATAAAGTCAT^^ 

AAGATXX5AGCACTCXnTCTGGAGATOCrrrCACV^CX3GAG^ 

CAGCCACAGGTTTCATCGACGGTGACTTGATTGAGAGTTTCCT^ 

OGCCCX:AAGATGCAGGAGGTGGTGGCAAAOCTACAGTATGACXjATGG^ 

GTATGAAGCGAGAGG0CACroCAGACXJACXnx:ATCAAGGTTXnXK5AGGAGCT 

AACrCGGATOCATTAG 
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FIGURE 3 

MSYNYVVTAQKPTAVNCK::VTGHFreAEDLNIXIAKNT^^ 

VGNlYOmVMElJrRPKGESKDUmTAKYNACILEYKQSGESIDirrRAHG^ 

RIGRPSETGnOITOPECRNnGLM.YDGLI'KVIPLDMJNKELKAFN^^ 

PLYGOQAPTICFVYQDPQGRHVKTYEVSLREKEFNKGPWKQENV^^ 

PEPFGGAmGQESITYHNGDKYLAIAPPIIKQST[VC3mVDPNGSRYIl^ 

UmilJEKEEQMIXnVIIJmLRVELLCTTOIAECL^^ 

ViaLNVDSNE(K5SYWAMETFTNIXa'IVDMCVVDLmQG<KKJ 

IJR]IRNGIGIHEHASmUH3IK(H,WIUlSDF!NBEID^ 

NISVASa^SSQVWAVGRALYYIXJIHPQELRQISinEMEHEVACIJ>ITEL^ 

Ii»FLX:AIGLWroiSARILKLPSFELLHKEMLGGEI^ 

A11WiI2«ETGIJLSDRKKVTIjGTQPTVUITO 

LVFS^^^NUCBVNYMCPU^SIX5YPDSIJ^IANNSTLTIG^ 

RKICYQEVSQCKSVI^SRIEVQDTSGGTrAUU'SASTQAI^VSSSn-FSSSTAPH 

ETSFGEEVEVHNLLinXJHlFEVlJEiAHQFLQNEYALSLVSCKl^ 

MVYPEEAEPKQGRIVVFQYSDGKLQTVAEBCBVKGAVYSMVEFNGKLI^ 

RLYEWTTEKELRTECNHY^I^^MALYUCtKGDFILV(^ 

EEIAIU>FNFNWMSAVEIIJDDD^IFLGAE^IAFNIPVOQ 

U«LGEFVNVFCHGSLVMQNlXiETSTPlXK}SVIJ<^^ 

LLDMQNMliKVKSVGKIEHSFWRSFHlERKTEPATGFI^ 

EVVANUJYD1X5SGNIKREATADDLIKVVEELTRIH 
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FIGUBE4 
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Figure 8 
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1. clone X ORF into GST fusion vector pGEX-2TK 

2. transfomt E. coll 

3. induce the expression of GST-X In bacterial culture 

4. harvest cell pellet 

5. incubate sonicated cell iysate with glutatiiione beads 

6. wash beads to purify GST-X protein 

7. mix with radio-labelled XAP-1 (synthesized In vitro) 

8. wash beads and elute the associated proteins 

9. SDS-PAGE and autoradiography 

figure: u 
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